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ARI  Research  Reports  and  Technical  Reports  are  intended  (or  sponsors  ot 
R&O  tasks  and  for  other  research  and  military  agencies.  Any  findings  ready 
for  implementation  at  the  time  of  publication  are  presented  in  the  last  part 
of  the  Brief.  Upon  completion  of  a  major  phase  of  the  task,  formal  recom¬ 
mendations  for  official  action  normally  are  conveyed  to  appropriate  military 
agencies  by  briefing  or  Disposition  Form. 


FOREWORD 


The  Manpower  and  personnel  Research  Laboratory  of  the  U.S.  Army  Research 
Institute  for  the  Behavioral  and  Social  Sciences  (ARI)  conducts  research  in 
areas  related  to  manpower  and  personnel  issues.  One  key  issue  is  reading 
performance  of  Army  soldiers  as  shown  through  reading  assessment.  Reading 
assessment  has  become  an  important  problem  not  just  for  training  but  also  for 
personnel  selection  and  classification.  This  report  addresses  Army  reading 
assessment  from  a  theory  base  that  is  both  current  and  useful.  The  research 
was  conducted  under  ARI  Project  Number  2Q263731A791.  The  information  reported 
here  will  be  helpful  to  policymakers  and  scientists  concerned  with  military 
selection,  classification,  training,  and  education. 


EDGAR  M.  JOHNSON 
Technical  Director 
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A  THEORY-BASED  APPROACH  TO  READING  ASSESSMENT  IN  THE  ARMY 


EXECUTIVE  SUMMARY 


Requirement: 

To  address  practical  Army  problems  in  reading  assessment  from  a  theory 
base  that  reflects  the  most  recent  and  most  sound  research  on  reading 
comprehension. 


Procedure: 

Specific  aspects  of  the  Army  reading  assessment  problem  were  identified. 
The  most  current  and  relevant  reading  theory  and  its  implications  for  reading 
assessment  in  general  were  then  explained.  Finally,  the  relationship  of  those 
implications  to  Army  reading  assessment  was  described. 


Pindings: 

Six  major  conclusions  are  drawn  from  both  theory  and  practice.  First, 
reading  is  important  in  military  and  civilian  work  life.  Second,  reading 
assessment  is  a  highly  visible  and  important  issue  in  the  Army.  Third, 
reading  theories,  especially  the  new  interactive-inferential  theory,  can 
positively  influence  reading  measurement  practices  in  the  Army.  Fourth, 
reading  tests  differ  widely  in  terms  of  psychometric  characteristics  and 
overall  quality  as  evaluated  by  theory-based  standards.  Fifth,  the  Armed 
Services  Vocational  Aptitude  Battery  (ASVAB)  and  various  reading  tests  have  a 
high  correlation,  although  caution  needs  to  be  exercised  in  substituting  any 
part  of  the  asvab  for  a  reading  test.  Sixth,  alternatives  to  grade  equivalent 
scores  are  available  and  should  be  considered  for  use  In  the  Army. 


Utilization  of  Pindings: 

This  report  has  utility  for  Army  selection,  classification,  training,  and 
education.  The  findings  nay  also  be  helpful  to  other  military  services  con¬ 
cerned  with  reading  assessment  issues. 
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A  THEORY-BASED  APPROACH 
TO  READING  ASSESSMENT  IN  THE  ARMY 


The  U.S.  Army  Research  Institute  for  the  Behavioral  and  Social  Sciences 
(ARI)  has  been  involved  in  research  on  reading  assessment  in  the  Army  from 
both  practical  and  theoretical  perspectives.  The  purpose  of  this  paper  is  to 
address  practical  Army  problems  in  reading  assessment  from  a  theory  base  that 
reflects  the  most  recent  and  most  sound  research  on  reading  comprehension. 

This  paper  has  seven  sections.  The  first  two  sections,  which  concern  the  im¬ 
portance  of  reading  in  work  life  and  the  importance  of  reading  assessment  in 
the  Army,  provide  the  basis  for  addressing  the  overall  issue  of  reading  as¬ 
sessment  in  the  Army.  The  reading  theories  presented  in  the  third  section  are 
helpful  in  generating  evaluative  criteria  for  reviewing  potential  measures  of 
reading  for  Army  use.  In  the  fourth  and  fifth  sections,  three  categories  of 
measures  are  considered:  civilian,  military,  and  the  special  case  of  the 
Armed  Services  Vocational  Aptitude  Battery  { ASVAB ) — a  military  test  of  general 
cognitive  ability  with  potential  value  as  a  surrogate  reading  measure.  The 
sixth  section  deals  with  scoring  issues  that  need  to  be  addressed  no  matter 
what  measure  is  used.  The  final  section  offers  conclusions  based  on  both 
theory  and  practice. 


READING  IS  IMPORTANT  IN  MILITARY  AND  CIVILIAN  WORK  LIFE 

Military  research  on  reading  has  shown  that  reading  is  an  important  part 
of  military  life.  Sacher  and  Duffy  (1978)  found  that  military  workers'  over¬ 
all  job  performance  suffered  if  they  read  more  than  two  grade  levels  below  job 
demands.  Sticht  (1980)  observed  a  low  generalizability  of  general  reading 
skills  to  specific  literacy  tasks  in  the  military  and  emphasized  the  need  for 
job-related  reading  tests,  Sticht  (1982)  also  investigated  empirical  rela¬ 
tions  between  reading  proficiency  and  job  proficiency  in  the  military  setting 
and  found  that  the  armed  services  pose  demands  for  basic  skills  equal  to  or 
greater  than  those  of  civilian  jobs.  Even  with  higher  enlistment  standards, 
many  military  recruits  are  below  awrage  in  basic  skills,  including  not  only 
reading  but  also  other  skills.  However,  Sticht  (1982)  found  that  basic  skills 
competence  as  measured  by  a  variety  of  tests  did  not  appear  to  be  the  over¬ 
riding  determinant  of  success  in  the  military.  The  precise  relationship  be¬ 
tween  reading  skills  and  job  skills  needs  further  investigation. 

In  addition  to  military  research,  civilian  research  may  have  some  im¬ 
plications  for  Army  literacy.  Diehl  and  Mikulecky  (1980)  observed  100  workers 
representing  a  cross-section  of  occupations  and  found  that  90%  of  the  workers 
participated  in  some  form  of  reading  each  day.  In  a  later  study,  Mikulecky 
(1982)  compared  high  school  reading  to  work  reading  and  found  that  students 
read  less  for  school  than  most  workers  did  for  work.  Workers  read  more  often 
for  application  or  to  make  judgments,  while  students  read  to  gather  facts. 
Mikulecky  and  Strange  (in  preparation)  noted  that  the  workplace  calls  for  a 
wider  range  of  literacy  strategies  than  does  school  literacy,  95%  of  which  is 
based  on  textbooks.  These  civilian  research  results  may,  by  implication, 
underscore  the  importance  of  reading  in  many,  if  not  all,  military  jobs. 


READING  ASSESSMENT  IS  IMPORTANT  IN  THE  ARMY 


In  the  last  few  years  reading  assessment  has  become  a  key  issue  in  the 
Army  primarily  because  of  changes  in  the  reading  ability  levels  of  the  Army 
population.  Population  shifts  are,  in  turn,  related  to  a  multitude  of  fac¬ 
tors,  such  as  changes  in  the  national  economic  picture  and  variations  in 
norming  of  the  ASVAB  used  for  Army  selection  and  classification.  In  recent 
years  the  Training  and  Doctrine  Command  (TRADOC),  the  Forces  Command  (PORSCOM), 
the  Education  Directorate  of  The  Adjutant  General's  Office  (TAGO),  and  the 
Office  of  the  Deputy  Chief  of  Staff  for  Personnel  (ODCSPER)  have  all  turned  to 
ARI  for  practical  advice  on  reading  assessment.  For  example,  both  TRADOC  and 
PORSCOM  have  asked  for  ARI's  help  in  measuring  the  reading  level  of  noncom¬ 
missioned  officers  (NCOS).  TRADOC  Systems  Analysis  Activity  (TRASANA)  has 
requested  assistance  in  interpreting  its  data  on  reading  tests  of  officers  and 
enlisted  personnel.  TRADOC  is  currently  establishing  a  program  to  improve 
officers'  communication  skills  and  has  asked  for  ARI's  advice  on  the  use  of 
reading  tests  as  part  of  thu  program.  TAGO  is  funding  an  ARI  project,  known 
as  the  Job  Skills  Education  Program  (JSEP),  a  computer-based  effort  that  in¬ 
volves  the  assessment  of  job-oriented  basic  skills.  ARI  has  also  advised 
ODCSPER  on  the  difficulties  of  using  reading  as  a  reenlistment  criterion  for 
midterm  noncommissioned  officers. 

Army  agencies  often  ask  the  following  questions  about  reading  assess¬ 
ment:  "What  reading  tests  can  we  use?*  *What  do  our  reading  test  scores 
mean?*  and  ‘Since  ASVAB  scores  relate  to  reading  test  scores,  can  the  ASVAB  bp 
used  as  a  substitute  for  a  reading  measure?*  Recent  requests  for  assistance 
have  not  focused  on  the  more  central  question,  *How  does  reading  proficiency 
relate  to  job  proficiency?*  The  last  question--perhapc  the  most  crucial  ques¬ 
tion  for  Army  reading  assessment — is  an  empirical  one  that  research  has  not 
yet  answered  adequately.  However,  both  theory  and  research  provide  answers  to 
the  first  three  questions.  Before  addressing  these  questions  in  detail,  we 
will  summarize  several  key  types  of  reading  theories  and  indicate  their  impli¬ 
cations  for  reading  assessment. 


THREE  TYPES  OF  READING  THEORIES  ARE  EXAMINED 

Many  theories  of  reading  exist.  One  theory  unites  perceptual  and  cog¬ 
nitive  elements.  A  second  set  of  theories  relates  to  information  processing. 
A  third  theory,  the  most  recent,  concerns  inferencing  in  an  interactive  mode. 
VJe  will  describe  the  essence  of  these  theories  and  their  implications  for 
reading  assessment.  The  greatest  emphasis  is  on  the  last  theory  because  of 
its  major  implications  for  reading  assessment. 

Perceptual-Cognitive  Theory  Involves  strategies 

Gibson  and  Levin  (1975)  presented  a  theory  that  is  both  perceptual  and 
cognitive.  Although  they  stressed  most  heavily  the  perceptual  underpinnings 
of  the  reading  process,  these  researchers  also  indicated  that  psychological 
processes  of  mature  reading  go  far  beyond  perception  to  remembering,  problem 
solving,  and  e  v^nization  of  conceptual  knowledge  for  better  extraction  of 


2 


meaning.  Two  implications  of  perceptual-cognitive  theory  for  reading  assess¬ 
ment  are  (1)  the  cognitive  strategies  that  help  people  improve  their  reading 
can  also  be  used  to  help  people  perform  better  on  reading  teBts,  and  (2)  these 
strategies  are  not  innate  but  can  be  taught. 

Information  Processing  Theories  Analyze  Reading  Components 

As  noted  by  Carpenter  and  Just  (in  preparation),  information  processing 
is  characterized  by  efforts  to  understand  what  information  is  represented  in 
memory,  what  information  is  acquired,  how  processes  are  acquired  and  invoked, 
how  long  these  processes  take,  and  what  sources  of  error  exist.  A  major  con¬ 
tribution  of  information  processing  theory  is  that  it  acknowledges  the  role  of 
environment  (not  just  previously  acquired  knowledge  and  thought  patterns)  in 
explaining  reading  behavior.  The  information  processing  model  of  Sticht,  Beck, 
Hauke,  Kleinian,  and  James  (1974)  emphasizes  the  interaction  between  the  envi¬ 
ronment  and  cognitive  components  such  as  sensory  information  storage,  short¬ 
term  memory,  and  long-term  memory  to  generate  literacy  skills.  Hassaro’e 
(1975)  information  processing  model  of  reading  delineates  four  component 
processes  or  stages  between  the  language  stimulus  and  the  meaning  response: 
feature  detection  (sensation),  primary  recognition  (perception),  secondary 
recognition  (conception),  and  recoding  and  rehearsal.  Information  processing 
analyzes  reading  into  component  processes;  however,  the  parts  are  not  always 
easily  reconstituted  into  the  reading  act  (Carpenter  fc  Just,  in  preparation). 
Information  processing  theory  is  now  being  used  in  computer-reading  simula¬ 
tions,  in  which  successful  computer  programs  must  have  a  great  deal  of  'knowl¬ 
edge*  about  vocabulary,  language  structure,  and  the  topic  of  the  text.  Car¬ 
penter  and  Just  (in  preparation)  described  aome  recent  computer  simulations  of 
reading  based  on  information  processing. 

Information  processing  theories  of  reading  have  some  implications  for 
reading  assessment,  first,  these  theories  imply  that  environment  and  short- 
and  long-term  memory  are  very  important  in  the  reading  process  and  that  there¬ 
fore  reading  is  not  a  simple  thing  to  measure.  Second,  one  application  of 
information  processing  theory  (that  of  Sticht  et  al.,  1974)  has  shown  that 
reading  tests  need  to  be  functional  (i.e.»  related  to  the  person’s  environ¬ 
ment)  for  results  to  be  most  meaningful. 

Interactive-Inferential  Theory  Provides  a  New  Perspective 

Although  the  emerging  reading  theory  lacks  a  consistent,  descriptive 
name,  we  will  call  it  interactive- inf erential  theory.  This  theory  builds 
largely  on  the  information  processing  model  but  alro  goes  beyond  it  by 
demonstrating  the  interaction  between  higher  order,  interpretive,  metacog- 
nitive  processes  and  less  complex  processes.  Interactive-inferential  theory 
focuses  on  interactive  rather  than  linear  movement  in  reading  and  has  gener¬ 
ated  a  spate  of  research  that  centers  on  reading  comprehension  instead  of 
decoding.  We  devote  more  time  to  interactive- inferential  theory  than  to 
earlier  theories,  because  it  offers  a  more  'evolved'  view  and  a  numoer  of 
implications  for  reading  assessment.  According  to  this  theory,  reading  (1)  is 
an  active  process  in  which  the  reader  constructs  meaning  through  inference  and 
interpretation,  (2)  is  purposeful  and  hence  involves  motivation,  (3)  can  be 


3 


improved  through  use  of  cognitive  and  learning  strategies,  (4)  is  processed  by 
the  individual  through  interactions  among  several  levels  of  information,  and 
(5)  can  be  divided  into  a  set  of  subskills  that  fit  into  a  unified  process  but 
that  are  not  as  yet  fully  defined  or  understood  (Parr,  Carey,  &  Tone,  in  prep¬ 
aration?  J.  Orasanu,  personal  communication,  January  12,  1984).  Each  of  these 
characteristics  of  reading  is  discussed  below,  along  with  its  implications  for 
reading  assessment. 

Action,  interpretation,  and  inference.  The  reader  actively  constructs 
meaning  from  text  cues  by  calling  on  knowledge  of  language,  text  structure, 
writing  conventions,  and  the  topic  itself.  The  reader  applies  inference  and 
interpretation  to  go  beyond  what  is  given  (Parr,  Carey,  a  Tone,  in  prepara¬ 
tion).  Researchers  have  studied  text  cues  through  methods  of  text  analysis 
and  text  linguistics.  The  active,  inferential,  interpretive  nature  of  reading 
creates  at  least  seven  implications  for  reading  assessment. 

First,  reading  comprehension  tests  may  be  most  valid  when  they  are  func¬ 
tional,  that  is,  when  they  involve  materials  similar  to  those  needed  to  per¬ 
form  everyday  and  vocational  tasks  (Hiller,  1973).  'Reading  assessment  ought 
to  reflect  the  schema  domains,  syntax,  vocabulary,  style,  and  structure  of 
materials  that  will  need  to  be  read  by  the  individuals  taking  the  tests' 

(Farr,  Carey,  &  Tone,  in  preparation,  p.  24).  As  a  consequence,  job-relevant 
reading  tests  of  a  criterion-referenced  nature  (i.e.,  keyed  to  a  specified 
criterion  or  standard  of  performance)  may  be  more  useful  for  Army  populations 
than  are  other  kinds  of  reading  tests,  at  least  for  the  purpose  of  assessing 
comprehension.  However,  if  a  test  ia  given  with  the  intent  of  ranking  group 
members  and  not  with  the  intent  of  carefully  assessing  comprehension,  then  it 
might  be  acceptable  to  use  a  norm-referenced,  general  reading  test  that  is  not 
job  related. 

Second,  in  order  to  employ  a  somewhat  uniform  cognitive  schema,  reading 
test6  need  to  be  developed,  tested,  and  normed,  using  populations  similar  to 
those  who  will  take  the  test.  For  example,  an  adult  reading  test  should  not 
be  developed  using  just  children,  and  items  developed  for  children  may  not  be 
relevant  for  adults.  These  facts,  while  seemingly  straightforward,  are  over¬ 
looked  with  surprising  frequency. 

Third,  the  active,  inferential,  interpretive  characteristic  of  the  new 
theory  implies  that  tests  should  be  appropriate  to  the  examinees  insofar  as 
possible.  Therefore,  adaptive  (tailored)  testing  may  be  useful.  Adaptive 
testing  allows  the  individual  to  take  only  items  that  are  at  a  relevant  dif¬ 
ficulty  level  and  may  shorten  test  administration  time  (McBride,  1979).  The 
joint  services  are  now  designing  and  testing  for  the  ASVAB  computerized  adap¬ 
tive  testing,  which  could  be  used  for  adult  reading  tests  as  well. 

Fourth,  because  readers  of  various  cultural  backgrounds  can  interpret 
reading  passages  differently  and  come  to  different  conclusions,  test  bias 
(known  as  'differential  validity')  can  occur  unless  background  knowledge  is 
controlled  or  explained.  However,  because  reading  comprehension  depends 
greatly  on  background  knowledge,  such  knowledge  cannot  be  artificially  fac¬ 
tored  out  or  eliminated  by  using  esoteric  content  (Farr,  Carey,  i  Tone,  in 
preparation) . 
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Pifth,  reading  tests  of  literal  recall  are  not  as  revealing  or  lifelike 
as  reading  tests  that  demand  some  sort  of  inferencing.  Old-fashioned  tests  of 
literal  recall  are  therefore  to  be  avoided. 

Sixth,  the  theory's  encouragement  of  relevance  of  the  teat  to  the  exami¬ 
nee  indicates  that  child-based  scores  such  as  reading  grade  levels  (grade 
equivalents)  are  not  useful  for  adults,  although  those  scores  are  ubiquitous 
in  the  armed  forces  (Parr,  Carey,  (  Tone,  in  preparation;  Hiller,  1973). 

Seventh,  text  linguistics  as  used  in  the  new  theory  can  provide  guidance 
on  how  to  create  more  reliable  reading  test  items,  that  is,  items  that  accu¬ 
rately  and  consistently  measure  the  skill  in  question.  Sample  guidelines 
include  avoiding  items  that  require  stylistic  and  other  ambiguous  judgment; 
not  testing  for  incidental,  insignificant  information;  not  using  harder  vo¬ 
cabulary  in  questions  than  in  text;  and  avoiding  list-like  density  of  ideas  in 
test  items  (J.  Orasanu,  personal  communication,  March  9,  1984). 

In  addition  to  the  active,  inferential,  interpretive  aspect  of  the  new 
theory,  other  aspects— purposefulness,  cognitive  strategies,  nonlinearity,  and 
subskills — also  have  Implications  for  reading  assessment. 

Purposefulness.  The  fact  that  reading  is  purposeful  and  involves  moti¬ 
vation  implies  that  reading  test  developers  should  select  and  present  test 
passages  that  engage  the  reader  in  a  valid  purpose,  not  just  the  purpose  of 
passing  the  test  (Parr,  Carey,  fc  Tone,  in  preparation).  The  interest  level 
and  degree  of  relevance  must  be  kept  high  to  maintain  the  reader's  sense  of 
purpose.  A  negative  sense  of  purpose  (e.9.,  "If  I  don't  pass  this  reading 
test  I  may  not  be  allowed  to  reenlls*:*)  may  have  a  detrimental  effect  on  the 
performance  of  some  readers  but  not  of  others.  Functional  reading  tests  might 
instill  a  greater  sense  of  purpose  than  more  general  reading  tests  do. 

Cognitive  strategies.  An  implication  of  the  existence  of  cognitive  stra¬ 
tegies  is  that  such  strategies  can  improve  not  only  reading  in  general  but 
also  performance  on  reading  tests.  Different  strategies  useful  for  a  variety 
of  purposes  and  genres  can  be  taught.  Such  strategies  can  develop  readers' 
sensitivity  and  can  thus  improve  comprehension  of  s  passage  in  a  reading  test 
or  any  other  text  (Brown  k  Armbruster,  in  preparation).  Cognitive  strategies 
are  also  important  in  earlier  theories,  such  as  that  of  Gibson  and  Levin 
(1975). 

Nonlinearity.  Readers  use  interactions  among  several  information  levels, 
moving  "top-down”  and  *bottom-up”  and  mixing  higher  order  inference  with  simp¬ 
ler  perceptual  processes.  Because  of  the  nonlinear,  complex  interaction  that 
occurs,  reading  should  not  be  assessed  as  the  automatic  decoding  of  a  sequen¬ 
tial  string  of  letti cs. 

Subskills.  The  last  few  decades  have  seen  a  proliferation  of  subskills 
in  reading  tests  (Parr,  Carey,  (  Tone,  in  preparation),  despite  lack  of  ade¬ 
quate  subskill  definition,  lack  of  a  coherent  reading  theory  from  which  sub- 
skills  derive,  and  lack  of  a  consensus  on  how  many  items  are  needed  to  measure 
a  particular  subskill.  The  new  theory  may  yet  lead  to  a  better  understanding 
of  these  subskills  and  how  they  can  best  be  measured. 
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We  have  discussed  three  types  of  reading  theories:  perceptual-cognitive, 
information  processing,  and  interactive-inferential.  These  theories  all  have 
important  implications  for  reading  assessment.  Some  measure  of  agreement 
exists  across  theories  on  the  importance  of  examinee  context,  background 
knowledge,  and  cognitive  strategies  for  reading.  The  last  theory  seems  to  be 
the  most  helpful  in  offering  concrete  implications  for  reading  measurement. 

We  will  now  discuss  psychometric  qualities  of  available  reading  tests  in  light 
of  these  implications. 


PSYCHOMETRIC  CHARACTERISTICS  OF  AVAILABLE  TESTS  DIFFER 

In  this  section,  the  standards  implied  by  reading  theory  will  be  applied 
to  the  review  of  reading  tests  currently  available  for  Army  use.  This  review 
will  cover  both  commercially  and  militarily  developed  reading  tests.  Theory 
and  practice  tell  us  that  good  reading  tests  should  have  certain  psychometric 
characteristics,  including  appropriate  validity,  reliability,  norms,  and 
standards.  All  reading  theories  imply  the  need  for  valid  measures,  that  is, 
tests  that  measure  what  they  purport  to  measure.  The  emerging  interactive- 
inferential  reading  theory  particularly  emphasizes  the  need  for  a  type  of 
content  validity  (or  at  least  content  relevance)  that  reflects  the  schema  the 
reader  ordinarily  employs.  The  need  for  reliability  of  measurement  is  implied 
in  the  text-linguistic  basis  of  this  new  reading  theory.  Appropriate  norms 
(for  norm-referenced  tests)  and  appropriate  standards  (for  criterion- 
referenced  tests)  are  also  implicit  in  the  emerging  theory. 

Armed  Services  Have  Used  Commercial  Reading  Tests 

Standardized,  norm-referenced,  commercial  reading  tests  have  dominated 
the  armed  forces  market  in  the  past.  Widely  used  tests  have  included  the 
Adult  Basic  Literacy  Examination  (ABLE),  the  Test  of  Adult  Basic  Education 
(TABE),  the  Nelson-Denny  Reading  Test,  and  the  Metropolitan  Achievement  Test 
(MAT),  Reading,  Form  D.  All  of  these  tests  have  been  reviewed  elsewhere 
(Buros,  1972,  1975;  Oxford-Carpenter  fc  Schultz,  1983).  We  will  provide  a 
brief  assessment  of  these  tests  in  light  of  the  standards  implied  by  reading 
theory. 

The  ABLE  is  one  of  a  few  commercially  available,  psychometrically  sound 
reading  tests  originally  designed  for  and  normed  on  adults.  From  a  purely 
technical  standpoint,  the  ABLE  appears  to  be  a  good  commercially  developed 
reading  teat  for  Army  use.  The  ABLE  has  well-documented  reliability  (high 
.80s  and  .90s).  Concurrent  validity  is  based  on  administration  of  the  ABLE 
and  the  Stanford  Achievement  Test  to  elementary  and  junior  high  school  stu¬ 
dents  (.60-. 76)  and  to  an  adult  job  corps  group  (.36-. 72).  The  test  was  de- 
sigr^d  for  adults  with  varying  achievement  levels  and  for  adults  who  have  not 
completed  formal  eighth-grade  education.  Norms  are  based  on  6,000  elementary 
and  junior  high  school  students,  800  job  corps  members,  and  450  adult  basic 
education  students.  Vocabulary,  reading,  spelling,  and  arithmetic  subtests 
are  available  in  two  parallel  forms.  The  reading  subtest  focuses  on  compre¬ 
hension.  The  ABLE  was  originally  selected  for  use  in  the  Army's  Basic  skills 
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Education  Program  (BSEP).  However,  the  test  proved  unpopular  with  some  BSEP 
personnel  for  reasons  related  to  administration  not  technical  quality  (Raines, 
1983).  The  ABLE  was  replaced  by  the  TABE  for  BSEP  use  about  five  years  ago. 

Though  possessing  some  technical  merit,  the  TABE  was  chosen  more  on  prac¬ 
tical  than  technical  grounds.  The  TABE  covers  reading,  language,  and  arith¬ 
metic;  its  reading  subtest  contains  both  vocabulary  and  comprehension.  The 
test  is  actually  a  revision  of  the  California  Achievement  Test  (CAT),  which 
was  designed  for  children.  TABE  developers  removed  patently  childish  refer¬ 
ences  from  the  CAT  or  changed  them  to  adult  references.  TABE  subtests  have 
adequate  reliability  (in  the  high  .70s  through  .90s).  Validity  documentation 
for  the  TABE  is  not  compelling,  because  its  concurrent  validity  is  based  on  a 
correlation  of  .56  with  the  General  Educational  Development  examination  using 
a  small  sample.  The  idea  of  'inherited*  validity  from  the  CAT  is  spurious. 
Furthermore,  the  TABE  lacks  adult  norms,  and  its  use  in  testing  adults  in  the 
Army  has  drawn  strong  criticism  ( Bachem,  1982).  Clearly,  key  career  decisions 
for  soldiers  should  not  be  made  on  the  basis  of  children's  norms,  such  as 
those  of  the  TABE.  However,  the  TABE  has  been  useful  in  BSEP  for  diagnosing 
soldiers'  strengths  and  weaknesses. 

The  Army  has  frequently  used  the  Nelson-Denny  Reading  Test  and  the  MAT. 
Designed  to  assess  reading  for  college  placement  and  adult  reading  classes, 
the  Nelson-Denny  has  adult  norms.  The  MAT  was  given  a  new.  Army  cover  and 
used  as  the  U.S.  Armed  Forces  Institute  (USAFI)  Reading  Test  with  few  changes 
in  the  civi lian-to-military  transformation.  Its  norm  group  includes  no  adults. 
For  both  tests  reliability  is  strong,  but  validity  is  not.  The  Army  has  also 
occasionally  used  other  commercial  reading  tests,  such  as  the  Gates-McGinitie 
Reading  Test  and  the  Iowa  Test  of  Basic  Skills.  A  review  of  these  commer¬ 
cially  developed  reading  tests  is  presented  in  the  Appendix. 

These  tests  need  to  be  evaluated  on  the  basis  of  whether  they  provide 
appropriate  validity,  reliability,  and  norms  demanded  by  reading  theory  and  by 
practicality.  The  Army  particularly  needs  to  examine  the  validity  of  commer¬ 
cial  reading  tests  in  light  of  Army  needs.  Although  a  given  reading  test  may 
be  highly  valid  for  a  group  of  school  children,  it  may  not  be  valid  for  adults 
in  an  intense,  job-oriented  Army  setting.  Furthermore,  the  issue  of  standards 
is  pertinent  to  commercial  reading  tests  as  used  in  the  Army.  Most  commercial 
reading  tests  are  purely  norm  referenced  and  do  not  advertise  acceptable  stan¬ 
dards  of  performance.  However,  the  Army  has  occasionally  used  these  tests  in 
a  quasi-criterion-ref erenced  way  by  designating  a  given  grade  equivalent  test 
score  as  an  acceptable  minimum  standard  for  soldiers.  For  example,  ninth-grade 
reading  level  has  become  a  magic  number  to  some  Army  leaders.  Before  any 
standards  are  chosen,  those  standards  need  to  be  shown  to  correlate  highly 
with  Army  job  performance  (not  just  with  Army  job  materials),  insufficient 
research  has  been  done  in  this  fruitful  area. 

Armed  Forces  Have  Developed  Their  Own  Reading  Tests 

While  the  armed  forces  have  commonly  used  commercial,  norm-referenced 
reading  tests,  the  military  has  also  created  reading  tests.  The  Army  has  taken 
the  lead  in  much  of  this  test  development.  Throughout  the  1970s  and  the  early 
1980s,  the  Army  worked  on  a  series  of  job-related  reading  tests  for  Army  en¬ 
listed  personnel  (Claudy  &  Caylor,  1982;  Sticht,  1975,  1982;  sticht,  Hooke,  & 
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Cay  lor,  1982;  sticht,  Cay lor,  &  James,  1978;  and  Sticht,  Cay  lor,  Kern,  fc  Fox, 
1971).  These  efforts  produced  the  Job  Reading  Task  Tests  ( JRTT)  and  the  Job 
Reading  Test  { JRT) ,  which  are  MOS-specific  and  emphasize  functional  literacy. 
Unlike  the  JRTT,  the  JRT  is  normed  and  machine-scorable.  As  mentioned 
earlier,  the  Army  is  currently  involved  in  a  massive  test  development  under¬ 
taking  under  the  Job  skills  Education  Program  (JSEP),  which  will  provide 
computerized,  job-related  testing  in  basic  skills.  The  other  armed  services 
have  also  developed  their  own  reading  measures,  but  the  efforts  appear  to  have 
been  independent  rather  than  coordinated.  A  leading  developer  of  Army  reading 
tests  stated  that  the  Army's  own  reading  tests  have  never  been  accepted  be¬ 
cause  of  lack  of  internal  advocacy.  Support  existed  for  test  development  but 
not  for  test  use  (T.  sticht,  personal  communication,  August  16,  1983).  A  re¬ 
view  of  selected  militarily  developed  reading  tests  is  found  in  the  Appendix. 

An  important  merit  of  some  reading  tests  developed  by  the  military  is 
also  one  deficiency:  job-specificity.  The  emerging  reading  theory  encourages 
the  use  of  functional,  job-related  reading  tests.  However,  because  approxi¬ 
mately  350  jobs  exist  in  the  Army  alone,  the  task  of  developing  just  one  test 
form  per  job  is  a  monumental  endeavor.  Of  course,  to  enhance  reliability 
there  should  be  several  test  forms  per  job.  Furthermore,  rapid  technological 
change  in  job  content  means  that  job-related  tests  need  to  be  continually 
updated.  Such  an  endeavor  is  obviously  very  costly  and  difficult.  Other 
options  might  be  to  develop  reading  tests  for  career  management  fields  instead 
of  jobs  or  to  develop  reading  tests  for  only  the  highest  density  jobs.  Even 
these  options  are  not  simple. 

Given  the  issues  associated  with  use  of  commercial  and  military  reading 
tests,  the  Army  has  occasionally  sought  a  surrogate  measure  of  reading,  a  test 
that  can  be  said  to  measure  reading  while  not  actually  measuring  it.  Many  Army 
administrators  have  looked  longingly  toward  the  ASVAB  as  such  a  surrogate 
reading  test. 


ASVAB  SCORES  ARE  CORRELATED  WITH  READING  TEST  SCORES 

Because  most  reading  tests  at  present  cannot  adequately  account  for  the 
role  of  such  factors  as  background  knowledge  and  reasoning  ability,  such  tests 
may  be  seen  as  measures  of  intelligence  and  experience  as  well  as  measures  of 
reading  ability  (Parr,  Carey,  &  Tone,  in  preparation).  Consequently,  it  is 
not  surprising  that  reading  tests  are  often  highly  correlated  with  general 
measures  of  aptitude  or  ■trainability*  (Jensen,  1981),  such  as  the  ASVAB. 

Like  similar  tests,  the  ASVAB  is  *  reading-dependent,*  that  is,  reading  is 
necessary  but  not  sufficient  to  perform  well  on  the  test.  Moderate  to  large 
correlations  have  been  demonstrated  between  the  ASVAB  and  several  reading 
teats.  For  example,  Sticht  (1975)  showed  a  .65  correlation  between  an 
unspecified  reading  test  and  the  Armed  Forces  Qualifying  Test  (AFQT),  which 
covers  four  key  ASVAB  subtests.  The  Job  Reading  Task  Tests  developed  by 
Sticht  for  the  Army  also  have  moderate  (in  the  ,50s  and  .60s)  correlations 
with  the  AFQT  (Sticht,  1975).  Fischl  (1981)  found  that  the  USAFI  Reading  Test 
correlated  .80-. 95  with  various  composites  of  the  ASVAB  and  with  the  total 
ASVAB  for  a  sample  of  600  soldiers,  in  a  recent  investigation  involving  2,385 
Army  and  Marine  recruits,  a  .85  correlation  was  found  between  the  ABLE  and  the 
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General  Technical  (GT)  composite  of  the  ASVAB  (P.  Grafton,  personal  communi¬ 
cation,  August  IS,  1983).  Among  several  groups  of  soldiers  whose  English 
proficiency  was  limited  and  who  were  headed  for  Army  English-as-a-second- 
language  (ESL)  training,  Oxford-Carpenter  (1982)  found  very  low  correlations 
(in  the  teens  and  .20s)  between  the  ABLE  and  a  number  of  ASVAB  composites. 
After  ESL  instruction,  the  correlation  between  the  same  soldiers'  ABLE  and 
ASVAB  scores  rose  to  the  .40s  and  ,50s.  In  summary,  reading  tests  have  been 
shown  to  correlate  moderately  to  highly  with  the  ASVAB  as  long  as  language 
problems  (for  individuals  with  limited  proficiency  in  English)  do  not 
intervene. 

A  large  verbal  element  clearly  exists  in  the  ASVAB  (Valentine,  n.d.),  and 
some  researchers  have  conjectured  that  the  ASVAB  indirectly  measures  reading 
ability.  In  fact,  the  Army  has  devised  a  so-called  literacy  index  for  the 
ASVAB  by  equating  the  ASVAB-GT  with  a  composite  of  three  ABLE  subtests,  read¬ 
ing,  vocabulary,  and  arithmetic  reasoning  (P.  Grafton,  personal  communication, 
August  15,  1983).  The  index  involves  ABLE  grade  equivalent  scores,  which, 
while  apparently  simple,  have  some  complexities  that  are  discussed  next. 


GRADE  EQUIVALENT  SCORES  CAN  BE  MISLEADING 

Grade  equivalent  scores,  popularly  known  in  the  military  as  reading  grade 
levels  when  applied  to  reading  tests,  are  the  most  used  and  abuBed  of  all  score 
types.  The  emerging  interactive-inferential  reading  theory  implies  that  read¬ 
ing  tests  should  be  as  appropriate  as  possible  to  the  examinees.  Therefore, 
use  of  grade  equivalent  scores  with  adults  is  potentially  misleading  (Hiller, 
1973).  Sachem  denounced  the  use  of  grade  equivalent  scores  in  the  military 
setting:  "The  use  of  elementary  school  grade  levels  to  categorize  adult  com¬ 
bat  soldiers  seems  little  short  of  an  insult,  no  matter  how  desperate  their 
need  for  remedial  work  may  be"  (Bachem,  1982,  p.4).  Grade  equivalent  scores 
are  not  applicable  to  high  school  or  sdult  levels  because  these  scores  have 
ambiguous  meaning  beyond  the  earlier  years  of  constant  growth  (Ysseldyke  * 
Hurston,  1982),  and  because  they  compare  adults  not  with  . teir  peers  but  with 
children.  Even  using  grade  equivalent  scores  with  children  can  be  misleading 
(Oxford-Carpenter  A  Schultz,  1983).  scores  other  than  grade  equivalents  ate 
abundant,  and  many  are  potentially  useful  for  reading  assessment  in  the  Army. 
Some  useful  norm-referenced  scores  include  percentile  ranks,  standard  scores, 
stanines,  and  some  types  of  adjusted  gain  scores,  criterion-referenced  scores, 
which  are  frequently  encountered  in  Army  training  programs,  include  number  of 
objectives  mastered  or  passed,  number  of  trials  to  mastery,  and  time  to  mas¬ 
tery.  These  scores  may  also  be  useful  for  Army  reading  measurement.  The 
emerging  theory  of  reading  suggests  that  criterion-referenced  scores  may  be 
very  helpful,  because  criterion-referenced  tests  are  geared  toward  .pecific 
objectives  and  have  relevant  standards  and  may  therefore  be  more  germane  than 
other  types  of  tests.  However,  the  theory  does  not  rule  out  use  of  norm- 
referenced  tests  and  their  associated  scores. 


SIX  MAJOR  CONCLUSIONS  ARE  REACHED 

Six  major  conclusions  nay  be  drawn  from  both  theory  and  practice.  Pirst, 
reading  is  important  in  military  and  civilian  work  life.  Second,  reading  as¬ 
sessment  is  a  highly  visible  and  important  issue  in  the  Army.  Third,  reading 
theories — especially  the  interactive-inferential  theory — can  positively  influ¬ 
ence  reading  measurement  practices  in  the  Army.  Fourth,  reading  tests  differ 
widely  in  terms  of  psychometric  characteristics  and  overall  quality  as  evalu¬ 
ated  using  theory-based  standards.  Fifth,  high  correlations  exist  between  the 
ASVAB  and  various  reading  tests,  although  caution  needs  to  be  exercised  in 
using  any  part  of  the  ASVAB  as  a  reading  surrogate.  Sixth,  alternatives  to 
grade  equivalent  scores  are  available  and  should  be  considered  for  use  by  the 
Army. 
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Table  1 


I 


Review  of  Commercially  Developed  Reading  Tests 


omiow 

on  LATEST  MAIN  INTENDED  INTENDED  TEST 

TEST  SERIES  PUBLISHER  COPYRIQHT  PURPOSE!*)  POPULATION  SCORES  VALIDITY 


Adult  Bam  Learn¬ 
ing  Exim  (ABLE)' 

AND 

Psychological 

Corporation 

ig74 

To  determine  general 
Aducibon  level  ol  adults 
to  evaluate  edu*t  educe- 
bon  progrims 

Mutts  with  achieve 
ment  levels  grades 

1-4,  5-9.  10-12. 
adults  who  have  hot 
completed  lormai  6th 
gride  education 

Grade  equivalents, 
based  on  equating 

ABLE  toStmlord 
Achievement  Test 
(SAT)  usmg  scores  ot 
students  in  grades 

2-7.  authoii  entour¬ 
age  use  ot  local 
norms  (percentiles 
and  stanines) 

Concu'ren!  validity  low  to  moderate 
based  on  ID  administration  ot  ABLE 
and  SAT  to  elementary  and  rumor 
high  students  ( 60-  76).  (2)  adminis¬ 
tration  ol  ABLE  and  SAT  plngnph 
meaning  to  job  corps  group  (36-  72) 

SeiectABLE 

Psychological 

Corporation 

T971 

To  screen  for  determina¬ 
tion  ol  which  ABLE  level 
to  administer 

Same  as  tor  A81E 

Raw  score,  percant 
correct 

INA' 

Carver-Da  rty 
Chunked  Reading 
test 

Revrec  PuDhcition 
(developed  by 
Amencin  Institutes 
lor  Research) 

1S72 

To  measure  mtormahon 
stored  Ounng  raiding 

Grades  9-16  and 
adults 

Efficiency,  accuracy 
and  rate  scores,  no 
standardised  scores 

validity  besi  mewed  as  pilot  study 

Gates-McGmibe 
Fielding  lest 

Houghton  Mftflm 

1978 

To  assess  three  areas  ol 
reading 

Grades  1-12. 

Raw  score  extended 
standard  score 
percenbles 

Content  validity  suitable,  may  have 
high  verbal -K)  component  (has  high 
correlation  with  urge  Thomd*e 
wmai  iG). 

Gray  Of  11  Reading 
Test 

Bobbs-Memi. 

1967 

To  assess  oral  reading 
$W! 

Grades  1-16  and 
adult 

Grade  equivalents 

Dotal  scon  onfy) 

INA' 

Metropolitan 
Achievement  Test1 

Psychological 

Corporation 

1978 

To  assess  achievement 
m  a  number  oi  skjii 
areas 

Grades  25-95  for 
reading  subttst 

Scaled  score  per 
centrles.  stanines. 
grade  equrviients 

Content  validity  good  no  predictive 
ot  other  validity  mentioned  in 
reviews,  test  authors  suggest  local 
Cuinculum  vaM.fr  be  checked 

Nelson  Denrv 
Reading  Test 

Houghton  Mifflin 

1973 

To  assess  reading  tor 
college  piacemem  and 
adult  reading  classes 

Grades  9-16  and 
•dirts 

Percenbles.  grade 
aqurvalents 

Lime  dau  on  concurrent  and  predic¬ 
tive  validity  nothing  on  content  Of 
construct  validity 

R8H  Bisk:  Read¬ 
ing  and  Wonj  Test 

Richardson 

Bellows  Henry  S 
Company 

1969 

To  assess  raeding  com¬ 
prehension  and 
vocabulary 

Disadvantaged  adults 

Parcenwes.  standard 
scores 

Lacks  convincing  va'idity  data,  no 
statistics 

RBH  Test  Ol 

Reading 

Comprehension 

Richardson 

Bellows.  Henry  S 
Company 

1963 

TO  test  reading  com¬ 
prehension  using  six 
articles  related  to  busi¬ 
ness  and  industry 

Business  and 
mdustiy 

Parcentues.  standard 
scoras 

Lacks  convincing  validity  data,  low 
validity  coefficients  (  .17-  45) 

Tests  oi  Adult 

Bisrc  Education 
(TABEI 

CTB/McGrtw-Hiii 

1976 

To  measure  reading  and 

arithmetic  oi 

adults  using  an  adapted 
v*f*on  oi  a  childrens 
test 

Adults  reading  at 
levers  ol  children  in 
grades  2-4 
(Level  E).  4-6 
(LMl  M).  7-9 
(level  D) 

Grade  equnsients 
based  on  California 
Achievement  Tests 
(CAT),  197D 

Content  validity  based  onfy  on  item 
selection  procedures  from  CAT  Dues 
bonable  validity  Concurrent  validity 

56  with  GEO  test 

v.ide  Ringe 
Achievement  Test 

Level  ll* 

Jistak  Assessment 
Systems 

1978 

To  assess  quickJy  three 
discrete  areas  oi 

achievement 

:  years  to  adult 

Standard  scores, 
graded  equivalents, 
percenbles  based  on 
W 

Content  vaWiy  questionable 

1  -  irrto'matMjn  not  Miiibie 

2  -  Review  concerns  reading- related  subtests  only  (e  q..  reeding.  vocabulary.  sp.:img) 

3  -  Timing  for  whole  test 
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Table  1  (Continued) 


Review  of  Commercially  Developed  Reading  Tests 


»HXJP 

vi  summon  him 

ntUASiuTY  non  ms  imdwioual  timing  tconmo  foams  scone  ama*  ouauty 


Sou  n*i*  rt  jy'ty  good 
to  excellent  to;  school 
group  (  87-55)  K>C>  corps 
group  (85-961.  and 
adult  bssrc  education  stu¬ 
dents  ( 91-54)  No  Won 
mation  on  teslreltst  ot 
other  reliability 

Based  on  6000 
elementary  and 
junior  high  stu¬ 
dents.  800  tot) 
corps  members, 
and  *S0  adult  basic 
education  students 

Group 

IP*!  1  145  mm 
level  2  145  mm 
Level  3  207  mm  * 

Hanc  o-  Macnme 

ABB 

Vocabu'aiy.  read-ng 
SpeLmg.  inTT-.menc 

He^s  appear  lecbr'ica'7 
we:1  constructed  Most 
items  relevant  to  ado  ts 
Reading  items  are  cloze 
types  and  do  not  mea 
sure  a!'  facets  of 
reading— jus!  reading 
comprehension 

KR-20  84-07.  SO'U-W 
B6-91 

452  military 
recruits 

Group 

15  mm 

Hand 

One  Form 

Total 

Adequate 

Reliability  besi  viewed  as 
twiot  stu<ty— nol  hiQh 

Totally  unsatisfac¬ 
tory.  means  ano 
standard  deviations 
available  on  at 
college  student 
volunteers' 

Group 

25  mm  optional 

Machine 

ABB 

Efficiency,  accuracy, 
rate 

Chvides  sentences  into 
Chunks  but  ft  ltrout  a 
consistent  meoreticai 
basis  related  to  the 
reading  act 

Alternate  Jorm  reliabi:,ty 

1  72-90} 

Normmg  procedures 
good 

Group 

Le*  A  B  50  mm 
Level  C  50  mm 

Level  CS  7  mm 

Leve’  0  40  nun 

Level  t  4 a  mm 

Hand  or  Machine 

12  3 

Speeaaccuracy. 

vocabulary 

comprehension 

Adequate 

Standard  t"ors  o'  mea 
suremen;  adequate 

Norms  based  on 
very  sma"  samples 
(N*40  oe-  g-ade) 

INA1 

INA’ 

INA’ 

A.  B  C  0 

Comprehension  tola) 

items  satisfactory  to 
good,  ora'  rtacmg  on'y 

I'rternai  cons  stency  rei-a 
Cuiiiy  90.  standaro 
errors  ot  measurement 
excellent 

Norm  group  care 
fully  selected 

G'Oup 

Leve1  P-2  160-170 
mm 

Uvelt  228-235 
mm  . 

Level  t  270  mm  . 
Level  A  285  rnm* 

Hand  or  Machine 

J&  K 

Wb'd  knowledge 
reading,  total  void 
analysis  '.or  language, 
depending  on  test 
level),  spelling,  maihe 
manes  ..rompuiauon. 
concepts  problem  solv 
mg  toul) 

items  wr.nen  lor 
children  no:  adults  Con- 
lent  problems  because 
contem  remote  born 
adult  ide 

High  anemaic  fo'ms 
reliability  *>'  most  sub¬ 
tests  based  on  smai: 
samples,  hoover 

Standardization  OK. 
adult  norms  based 
on  cut  time  admin 
istntion  onfy 

Group 

30-35  mm 

Hand  or  Machine 

ABB 

Vocabulary  compre 
tienswn  total,  rate 

Mostly  *eh  constructed 
items 

LacKS  convincing  reiiaPi'- 
ity  data  1 86  probably  an 
overestimate) 

INA’ 

Group 

25-30  mm 

Hand 

One  Form 

Total 

Items  cover  mostly 
vocabulary  Choice  of 
tfjcabuiary  tested  is 
questionable 

Reii«b*tv  oata  not  ade- 
quate.  r»o  means  and 
stanaahJ  deviations 
given 

Test  probably  *as 
too  easy  for  norm 
group,  norms  not 
adequately 
standardized 

Group 

25-25  mm 

Hand 

One  Form 

Tola) 

INA' 

KR-20  ♦or  *e3ding  test 
section  86-94.  test 
retest  tor  reading  79-85 

No  adu'i  no'ms. 
norms  based  on 
students  m  grades 
2-9 

Group 

Level  E  127  mm 
Leve'  M  209  mm 
Le*:  D  191  mm  1 

Hand 

3  B  4 

Reading  (vocabuiaiy 
comprehension  Iota') 
Anthmetic  (reasoning 
fundamentals  iota>) 
Language  (mechanics 
spelling  iolai)-loi  top 
two  i*«is  Total  (Loca 
'o'  test  also  available) 

items  v*re  wnner  for 
children  but  have  been 
revised  to  omit  pureiy 
chiio.sh  references 

• 

High  reported  reliabilities 
( 98)  a*  suspect 

No  national  normmg 
sample  Norms 
developed  trom 
continuous  age 
dale  identity  and 
nature  Ot  norm 
group  not  clear 

Pan 

individual 

20-30  mm 

Hand 

One  Form 

SpeHmg.  anttimetK. 
read-ng  (based  on 
Clmicai  factor 
analysis') 

Questionable  item  devel 
oprnem  and  qua  r, 
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Table  2 


Review  of  Selected  Military  Developed  Reading  Tests 


DEVELOPER 


TEST  SERIES 

OR 

PUIUIHER 

LATEST 

COPYRIOHT 

MAIN  INTENDED 
PURPOSE  IS) 

INTENDED  TEIT 
POPULATION 

KtMEI 

VALIDITY 

Am  Force  fielding 
Ability  Tesl 
(AFRAT)’ 

U  S  Aar  Force 

INAf 

16  assess  reading  ability 
among  lair  to  excellent 
made  rs  in  the  Air  Force 

5th  grade  through 
college  lever  not 
mlended  for  poor 

readers 

GE.  percentile 

Concurrent  validity  is  7?  with  Cal'tO'- 
nta  Achvt  Test.  7$  with  Nelson 

Denny  Predictive  validity  moderate 
(  «fl  m  predicting  Otlicer  Training 

School  grades.  -  13-  61  in  predicting 
technical  diming  grades)  Compre¬ 
hension  a  better  predretof  than 
vocabulary 

Job  Reaflng  Task 
Tests4 

US  Army 

Circa 

Wl-1973 

To  assess  p'cto'mance 
on  Army  yifc  reading 
tasks  with  items  based 
on  3  military  occupa¬ 
tional  specialities 
(MOS).  lot  research  put- 
poses  onty 

Army  enlisted 
personnel 

Percent  correct, 
reading  grade  level 

Moderate  ( 64-  80|  concurrent  validity 
with  standardised  aclm !  test  Lower 
concurrent  validity  with  AFOT 

Moderate  correlations  with  course 
grides  end  |0b  knowledge  tests 

Job  Read<  ig  Task 
Tests4 

US  Arrrry 

circa  1975 

To  assess  performance 
oo  Army  pb  raiding 

taste  witti  items  based 
on  6  MOS 

Army  enitsted 
personnel 

GE.  percentiles 

Content  validity  good  based  on  items 
from  empinca'ty  determined  tob 
rating  tasks  concurrent  validity 
ranges  69-  79  with  USAFI RGL 
entenon 

Job  Reading  Tests5 

US  Army 

cuca  1962 

To  assess  performance 
on  Army  rob  reading 
tasks  with  items  based 
on  6  MOS 

Army  enlisted 
personnel 

Percentile 

Validity  data  not  convincing  Item 
validity  aptiearc  adequate  Onty  infor¬ 
mation  on  test  validity  is  that  the  con¬ 
tent  is  dtewn  carctu  -v  from  6  MOS 

US  Armed foces 
institute  lUSAFij 
Reading  Test 
(same  as 

Metropolitan! 

1  -  Information  trom  Mathews  (peryonnal  communication.  September  7.  1963)  and  Valentine  (n  d  | 

2  -  Information  not  available  trom  published  source 

3  -  One  form  assumed  unless  more  than  one  o  specified 

4  -  Information  from  Stic  fit  (1975) 

5  -  Information  from  CUudy  6  Caffor  (1962) 
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Table  2  (Continued) 


Review  of  Selected  Military  Developed  Reading  Tests 


SflOUP 

vt  tunun  on  item 

RELIABILITY  NORMS  INDIVIDUAL  TIMINQ  SCORINS  FORMS'  SCORE  AREAS  QUALITY 


interna:  consistency 
reliability  approx  .90  lor 
whole  lest 

Mull  rule  and 
female  Air  force 
i nd  Army  enlistees 

Group 

50  mm  total  test 

Hand  or  Machine 

A.  8 

Comprehension, 
vocabulary,  total 

Vocabulary  and  com¬ 
prehension  items  abort 
average  in  quality 

Test-retest  re'iab  iity 

75-flC  No  Other  reliabi! 
ity  in’ormabon 

INA> 

Group 

INA* 

IN  A1 

Three  forms.  COOk. 
Supply  clerk,  whi¬ 
te  repairman 

Tables  standards  and 
specifications  identifi¬ 
cation  and  description, 
procedural  directions, 
check  points,  fire- 
bona!  Description 

INA> 

KR-21  »-9d.  alternate 
forms  60-  76.  St  01 
msmi  02-96 

750  young  adult 
mate  Army  recruits 

Group 

1  hr  ♦ 

Hand 

A,  B.  C 

Using  index  lo  locate 
intormabon.  extracting 
information  from  tables 
and  narrattve  prose, 
following  procedural 
directions 

free -response,  fill-in- 
blank  items  cause  slow 
scoring 

INAZ 

Norm.no  appears 
adequate 

Group 

30-40  mm 

Machine 

a.  b.  : 

Locating  job  informa¬ 
tion  m  tables,  index, 
graphs  end  narrative, 
forms  completion 

Hem  statistics  appear 
adequate  Multiple 
choice  items 

L9 


